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Abstract 


Folding of the capped LQQLLQQLLQL peptide is investigated at the water-hexane interface by 
molecular dynamics simulations over 161.5 nanoseconds. Initially placed in the aqueous phase 
as a /3-strand, the peptide rapidly adsorbs to the interface, where it adopts an amphipathic 
conformation. The marginal presence of non-amphipathic structures throughout the complete 
trajectory indicates that the corresponding conformations are strongly disfavored at the in- 
terface. It is further suggestive that folding in an interfacial environment proceeds through a 
pathway of successive amphipathic intermediates. The energetic and entropic penalties involved 
in the conformational changes along this pathway markedly increase the folding time-scales of 
LQQLLQQLLQL, explaining why the a-helix, the hypothesized lowest free energy structure for a 
sequence with a hydrophobic periodicity of 3.6, has not been reached yet. The formation of a 
type I /3-turn at the end of the simulation confirms the importance of such motifs as initiation 
sites allowing the peptide to coalesce towards a secondary structure. 


Key words: Molecular dynamics simulations; amphipathic peptides; aqueous interfaces; pep- 
tide folding; free energy calculations 
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Introduction 




Many proteins, involved in such essential cellular functions as energy transduction, signal trans- 
mission, transport of nutrients and waste products, cell fusion and narcosis, and catalysis of 
some metabolites, interact with cell membranes either by binding to their surfaces or by insert- 
ing themselves into the bilayer. 1,2 Integral membrane proteins are usually large and complex, 
and can contain both water-soluble and transmembrane regions. Whereas the former are usu- 
ally built of diverse and flexible secondary structure elements, structural motifs characteristic 
of the latter are relative simple — viz. most often, either bundles of o-helices or /2-barrels. 3 
Typically, transmembrane elements of the secondary structure are quite stable, preserving their 
integrity in the membrane even if they are separated from the rest of the protein. 4-6 Moreover, 
the transmembrane segment alone, or with just a few additional residues, can retain the essen- 
tial biological activity of the whole protein, 4 ’ 5,7 ’® which is in sharp contrast with water-soluble 
proteins that only rarely preserve their structure or function upon truncation. This point is 
further reinforced by findings that some simple, synthetic peptides in the membrane can per- 
form functions normally reserved to large proteins. 7,9,10 In view of the considerable difficulties 
in obtaining high resolution, three dimensional structures of transmembrane proteins, this is 
very fortunate because it allows us to investigate simple peptide models with some faith in their 
biological relevance. 

The structural stability of simple membrane peptides can also be used profitably to study 
protein folding, especially the early events leading to the formation of secondary structure 
elements. These events cannot be investigated conveniently using peptides dissolved in water, 
because only very few of them remain in a well-defined, ordered conformation. In contrast, 
membrane peptides, which are also usually disordered in water, 11-14 fold prior to, during or 
after insertion into the bilayer. Although the exact relation between folding and insertion has 
not been unequivocally established, most experimental 15 and theoretical 16-18 results support a 
hypothesis 15,19,20 that elements of the secondary structure are formed at the water-membrane 
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interface, and, only then, does the protein partition into the bilayer. Transmembrane proteins 
share this property of interfacial folding with peptides that reside at the membrane surface. 
These peptides are quite interesting in their own rights since they include many hormones, 
toxins, antibiotics and membrane fusion proteins. 

Similarities between transmembrane and surface peptides often extend much farther than fold- 
ing behavior, and are manifested in the peptide sequences, many of which contain periodically 
spaced polar and non-polar amino acids. Possessing the appropriate hydrophobic periodicity, 
they can form a-helices, in which polar and non-polar residues are located at opposite faces. 
At the water-membrane interface, such an amphipathic helix can readily adopt an orienta- 
tion in which the polar face is exposed to water, while the non-polar face is buried in tfie 
hydrophobic environment of the hydrocarbon tails in the core of the bilayer. This match be- 
tween the polarities of the peptide and the environment should render the amphipathic a-helix 
particularly stable. Numerous experimental studies confirm that naturally occurring 23-26 and 
synthetic 13 ’ 27-31 peptides, as well as fragments of larger proteins , 32-35 which have sequences 
capable of forming amphipathic a-helices, indeed, do so. Typically, these peptides adopt orien- 
tations parallel to the interface . 26 - 36-39 At sufficiently high concentrations and/or in the presence 
of an electric field, some of them, however, rotate to the transmembrane orientation , 10,26 ' 29,40 
and may associate to form channel-like bundles with their polar faces lining the pore and their 
non-polar faces exposed to the hydrophobic environment in the membrane. 

In this paper, we investigate the mechanism by which a simple peptide folds into an a-helical 
structure. In line with the above discussion, this is done at an interfacial environment, so that 
the results are relevant to both surface and transmembrane proteins. As a model, we have 
selected the Ac- and NHMe- terminally blocked undecamer built of L-leucine and L-glutamine 
residues. We have chosen the LQQLLQQLLQL sequence because it becomes amphipathic in the 
a-helical conformation. Although there are no experimental data for this particular peptide, a 
large body of evidence obtained for other amphipathic peptides, and discussed above, strongly 
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indicates that this molecule will form an a-helix at the interface. Particularly relevant to this 
work are studies on peptides similar to LQQLLQQLLQL, but built of L-leucine and L-serine, 
because the conformation of these peptides was shown to be a-helical at the water-membrane 
interface . 41 In this study, we have chosen L-glutamine rather than L-serine as the polar residue 
because the serine side chain cannot form easily hydrogen bonds with the peptide backbone 
which may importantly affect the folding of many amphipathic peptides. Interestingly enough, 
as has been shown by O’Neil and DeGrado , 42 L-glutamine and L-serine are equivalently good 
helix formers. 

Our approach is based on atomic-level moleculax dynamics simulations with the explicit rep- 
resentation of the solvent. The main difficulties of this approach are the limited time-seal e!$ 
accessible to such calculations. For this reason, computer simulations of the mechanism of pep- 
tide folding were limited, until recently, to approximate approaches that relied on an implicit 
representation of the solvent, 43-45 and a simplified description of the amino acid residues. 46-48 
Molecular dynamics simulations with full atomic representation of both the protein and the 
solvent were previously employed only to study unfolding, 49-51 often at unrealistically high 
temperatures chosen to accelerate this process. Only lately, advances in computer technologies 
have made it possible to investigate folding directly using the same methodology. 14,52-54 Most 
notably, pathways to folding of the 36 residue-long villin headpiece subdomain in water into 
a state that resembles the native conformation were observed in a 1 microsecond simulation. 52 
This study required a very large commitment of computational resources of a massively paral- 
lel supercomputer, clearly demonstrating that even relatively simple folding problems remain 
a challenge. 

For systems at the water-membrane interface, additional factors complicate the simulations. 
One of these is the slow relaxation of lipid molecules in response to conformational changes of 
the peptide. Furthermore, the high charge density in the phospholipid head group region makes 
it necessary to include explicitly long-range electrostatic effects. Finally, standard molecular 
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dynamics methods may not be appropriate for systems in which the folding of a protein is 
accompanied by changes in the dimensions of the simulation box. Instead, an “extended system” 
treatment may be required. 55 - 56 The dynamics of the protein in the extended system, however, 

is no longer Newtonian, and it is not known to what extent it captures the real dynamics of 
folding. 

Many of the difficulties outlined above can be avoided by considering a water-alkane, instead 
of a water-membrane, system. Such a membrane-mimetic system captures the most important 
characteristic of water-membrane interfaces — the coexistence of a polar, aqueous phase and a 
non-polar medium. In fact, water-alkane or even water-air interfaces have been used occasion- 
ally in both experimental 34,41,57-60 and theoretical 14,61 ^ 54 studies of membrane-active peptides. 
In particular, it has been shown that amphipathic peptides fold into the same secondary struc- 
ture at all three interfaces. 41,57,58 These considerations motivate our choice of water-hexane 
rather than water-membrane as the interfacial system. It should be, however, kept in mind that 
such an approximation has important limitations due to the absence of the many complexities 
of the water-membrane interfacial region. 33,65,66 In particular, electrostatic interactions between 
the peptide and the lipid head groups are neglected, even though they have been shown to affect 
folding of membrane surface proteins, especially if the protein contains charged residues. 67 

The present work can be considered as an extension of our molecular dynamics study on the 
folding of the undecamer of poly-L-leucine at the water-hexane interface. 14 In that study, the 
peptide fully folded into an a-helix in approximately 36 ns. In the simulation presented here, 
complete folding did not occur even though the molecular dynamics trajectory exceeded 160 ns. 
The simulation, nevertheless, sheds an interesting light on the early events of interfacial folding. 

Its results and their discussion are presented after the section covering the methodological 

aspects of the study. The paper closes with a summary and conclusions relevant to systems of 
biological interest. 



Model and methods 


The molecular dynamics simulations were carried out in the microcanonical (N, V, <§) ensemble, 
using the program COSMOS . 68 The interfacial environment of the Ac- and -NHMe terminally 
blocked LQQLLQQLLQL undecapeptide consisted of a water lamella of 1367 molecules in contact 
with a hexane lamella of 284 molecules. Each lamella was in equilibrium with its vapor phase. 
The dimensions of the simulation cell were 45.0 x 45.0 A 2 in the x,y-directk>DS, parallel to 
the interface, and 200 A in the z-direction, perpendicular to the interface. Periodic boundary 
conditions were applied in all three spatial directions. The Newton equations of motion were 
integrated using the Verlet algorithm, 69 with a time step of 2 fs. The length of the molecular 
dynamics trajectory generated to monitor the interfacial folding of the undecapeptide amounted 
to 161.5 ns. In order to maintain the temperature of the system at 300 K, the velocities of 
all the atoms were occasionally rescaled. Such rescaling, necessary to correct for inaccurate 
integration of the equations of motion, was performed only if during a 10 ps time period the 
average temperature departed from the required value by more than ± 3 K. Over 100 ps of MD 
trajectory, the deviation in the total energy was, on average, 0.44 %. 

The water molecules in the system were represented by the TIP4P model. 70 The hexane 
molecules were described by the OPLS potential energy functions, 71 in which methyl and methy- 
lene groups are represented by single, united atoms of the appropriate radius. It has been shown 
that this simple description of aliphatic chains yields good estimates of interfacial thermody- 
namic quantities such as the water-alkane surface tension, 72 or the water-hexane partition 
coefficients of small, organic molecules. 73,74 Although a united atom model generally appears 
to be sufficient for reproducing thermodynamic equilibrium properties, it may lead to overesti- 
mated diffusion rates in the non-polar phase, and, thus, can somewhat influence folding times. 
All intramolecular parameters of the potential energy function describing the undecapeptide 
were taken from the AMBER all-atom force field of Cornell et al . 7S Parameters representing the 
water-peptide and the hexane-peptide interactions were computed from the standard OPLS 
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combination rules. 71 


Pairwise Coulomb and van der Waals intermolecular interactions involving water molecules 
and/or small, electrically neutral groups 76 in the peptide (e.g. -CH 3 , -C=0, or -NH moieties) 
or in hexane molecules were smoothly truncated between 7.5 and 8.0 A, by means of a cubic 
spline switching function applied to both the energy and the forces. 77 ’ 78 The truncation was 
applied to the group centers. Long-range, electrostatic interactions were not taken into account 
in this simulation. This is an acceptable approximation, because the water-hexane interface is 
lacking charged or zwitterionic lipid head groups present in the water-membrane system, and 
the terminally blocked LQQLLQQLLQL is uncharged and not strongly polar. 79 

v 

High frequency degrees of freedom were eliminated by constraining the bond lengths and the 
valence angles of both water and hexane molecules, and the bond lengths between heavy atoms 
and hydrogen atoms in the undecapeptide to their equilibrium values. This was done using the 
SHAKE algorithm. 80 

The simulation was performed on a single R10000, 195 MHz processor of a 10-processor SILICON 
Graphics Power Challenge. The average CPU time per one molecular dynamics step was 
0.51 second, yielding a total CPU time of 476 days for the complete trajectory. 

Results and discussion 


Initially, the peptide was placed in the aqueous medium, near the water-hexane dividing surface. 
Its starting conformation was a /?-strand, which, for the sequence chosen, is not amphipathic. 
The undecapeptide, however, rapidly adsorbed to the interface and its backbone underwent 
conformational transitions that yielded an amphipathic structure, wherein the leucine side 
chains were buried in hexane and the glutamine side chains were immersed in water. As the 
density profiles in Figure 1 indicate, the peptide, once adsorbed at the interface, remained 
in this environment for the rest of the simulation. It exhibited a slight preference for the 


hexane side, perhaps due to the small imbalance in the number of hydrophobic and hydrophilic 
residues. The probability distribution, ^(z), of finding the center of mass of the peptide at a 
given distance, z, from the interface was converted to a free energy profile, AA(.r), along the 
^-direction, using the equation: 


A A(z) = —kgT log &(z) 


( 1 ) 


where kg is the Boltzmann constant and T is the temperature. The resulting curve, shown 
in Figure 2, clearly indicates that the undecapeptide is interfacially active — i.e. tends to 
accumulate at the interface — and suggests that there is a significant free energy penalty for 
its desorption into one of the bulk phases. As can be seen from Figures 1 and 2, however, 
the center of mass of the peptide is not rigidly pinned to the interface, but instead undergoes 
considerable translational motions along z. A similar result was obtained for the undecamer of 
poly-L-leucine at the same interface. 14 


To analyze the structural relationships among the conformations adopted by the peptide during 
the simulation, the distance root mean square deviations (RMSDs) for all pairs of conformations 
were calculated using backbone atoms only. The results are displayed in Figure 3 as a two- 
dimensional graph. In this graph, every pair for which the RMSD was less than 2 A is plotted as 
a dot. The clustering analysis indicates the presence of six distinct structural families, denoted 
(a)-(f), which exist in the following time, t } intervals: 



r o.o 

< t < 

6.2 ns 

(a) < 

10.1 

< t < 

90.0 ns 


[ 103.1 

< t < 

108.0 ns 

(b) 

6.3 

< t < 

10.0 ns 

(c) 

90.1 

< t < 

103.0 ns 

(d) 

108.1 

< t < 

121.0 ns 

(e) 

121.1 

< t < 

149.0 ns 

(f) 

149.1 

< t < 

161.5 ns 


Whereas the first three structural families are somewhat diffuse and show some similarities to 
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each other, each of the last three families appears to show little resemblance to any of the 

remaining five families. Typical conformations from the six families identified in the clustering 
analysis are shown in Figure 4. 

The two-dimensional graph, shown in Figure 3, indicates that the peptide is not conformation- 
ally rigid, but slowly evolves in conformational space. From this graph, however, one cannot 
learn whether the folding pathway of the peptide actually leads to a helical structure. This 
information can be obtained, nonetheless, by correlating the results of the graph with the Ra- 
machandran maps for the (<p,ip) dihedral angles of the 11 residues forming the undecapeptide. 
This is done in Figure 5. In these plots, the size of the points characterizing the conformation 
of a given residue corresponds to a particular structural family emerging from the clustering 
analysis i.e. the smallest, darkest dots denote conformations of family (a), whereas the 
largest, lightest dots represent conformations of family (f). As can be seen, the degree of flexi- 
bility exhibited by the 11 peptide bonds along the backbone differs. Four residues, viz. Leu 1 , 
Gin 2 , Leu 4 and Leu 9 , remain confined to a single quadrant of the ( <f > , ip) map throughout the 
simulation. Three of them, Leu 1 , Gin 2 and Leu 9 , occupy the “0” region, oscillating between 
a ^-strand and an extended conformation. For Leu 4 , only the right-handed, “a R ” quadrant 
is sampled, indicating that, early in the simulation, this residue moved away from its starting 
conformation to ensure the amphipathy of the undecapeptide. Residues Gin 3 , Leu 5 , Leu 8 , Gin 10 
and Leu 11 explore both the “0” and the “a R ” quadrants, but eventually return to the former. 

Only residues Glu 6 and Glu 7 sample the two regions before adopting an a R conformation at 
the end of the simulation. 

The analysis of the Ramachandran maps indicates that, although the peptide adopts different 
structures from several distinct families, none of them resembles closely an ordered conforma- 
tion, such as an a-helix. It, therefore, cannot be simply characterized by just a few parameters. 
Instead, these structures can be well described only by a set of global and local indices that 
complement the maps in Figure 5. 


11 


One such global index that provides a comprehensive, characterization of changes in the shape 
of short peptides during folding can be obtained from their radii of gyration, {/?£}, defined as: 


R k 0 



k = 1,2,3 


( 2 ) 


where {/*} denote the eigenvalues of the instantaneous inertia tensor of the peptide and M is 
its total mass. From the set {/?£}, a parameter, A 3 , referred to as asymmetry of the peptide, 
can be derived: 81 


A3 — 




( 3 ) 


The asymmetry, A$, can take values between 0 and 1, with 0 corresponding to a perfectly 
spherical shape, and 1 to a linear one. In Figure 6, the time "volution of A 3 is displayed and 
compared to the values representative of the ideal /3-strand, 3io- and a-helices. 82 For the first 
108 ns, when the peptide evolves through structural families (a)-(c), its shape remains close to 
that of the /3-strand, which was the starting point of the simulation. This global characteristic 
agrees with the impression from Figure 4. In the subsequent 40 ns, as the peptide adopts 
conformations from structural families (d) and (e), the asymmetry decreases and fluctuates 
around the value representative of an a-helix. Since many different conformations may corre- 
spond to the same value of A 3 , this similarity does not imply that the peptide becomes helical. 
It, however, suggests that families (d) and (e) may exhibit some structural resemblance to an 
a-helix. This expectation is, indeed, confirmed by examining the formation of intra-backbone 
hydrogen bonds between the carbonyl group of residue i — 4 and the amino group of residue i, 
which constitute the scaffolding of the a-helix. As may be observed from the time-evolution 
of all the nine C=0,_.j- • H-N, distances in the peptide, shown in Figure 7, three of them, viz. 
distances 5, 6 and 7, decrease to approximately 2 A between 108 and 120 ns. This clearly 
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points to the formation of three consecutive hydrogen bonds along the backbone near the cen- 
ter of the peptide. These bonds, however, are not rigid but exist in an equilibrium with the 
C=0,_3- • -H-N, hydrogen bonds characteristic of a 3 I0 -helix. 

The impression that the peptide begins to adopt a helical structure is further reinforced by 
examining Figure 8, which shows the time evolution of the distance RMSD between the unde- 
capeptide and ideal 3i 0 - and a-helices. 82 For the first 80 ns, average values of both RMSDs 
remain approximately constant. Upon the transition to family (d), the RMSD with respect to 
the a-hehx, however, decreases substantially, by nearly a factor of two. For the next 40 ns, 
again, the RMSD only fluctuates around an average value, which is close to the RMSD with 
respect to the 3i 0 -helix. This similarity reflects an apparent dynamic inter-conversion between 
the two types of helices in the folded fragment of the peptide. 

Although the nature of structures in families (d) and (e) appears to indicate that the peptide 
progresses towards th. a-helical state, further time-evolution of the system demonstrates that 
this is not the case. After 149 ns, all C=0,_4- • -H-N,- distances become longer than 6 A, as they 
were at the beginning of the simulation. Simultaneously, the asymmetry decreases to nearly 
zero, indicating that the peptide adopts a more spherical shape, and the distance RMSD with 
respect to the a-helix increases to approximately 6 A. All these indices point to a substantial 
conformational change that accompanies the transition from the structural family (e) to family 
(f). The snapshot of a representative conformation from the latter family, shown in Figure 4, 
reveals the nature of this change — the peptide adopts a hairpin conformation with a structure 
similar to a type I /?-turn formed in the middle of the sequence. Since this type of a reverse turn 
is just a deformed 3i 0 -helix, it is expected that a hydrogen bond be present in the middle of the 
backbone. This is, indeed, the case: The fifth C=0,_ 3 - • -H-N, distance in the undecapeptide 
becomes equal to approximately 2 A and fluctuates around this value until the end of the 
simulation. This clearly points to the presence of the required hydrogen bond, supporting the 
identification of the structure as a /3-turn. 
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The observed transition raises a question: Why did the peptide unfold instead of continuing a 
cooperative folding to an a-helix? It should be noted that three consecutive backbone hydrogen 
bonds, present in families (d) and (e) were located in the middle of the peptide rather than at 
one end, as would be expected in standard theories of helix-coil transitions in bulk media. 83,84 
One may speculate that this is precisely the reason for unfolding — the formation of an a- 
helix started from a wrong place and the peptide has to refold completely before it starts 
folding again, presumably from the end. Based on experience from the simulation of the 
folding of the terminally blocked undecamer of poly-L-leucine at the water-hexane interface, 14 
this is not a likely explanation. Poly-L-leucine folded completely to a helical structure, but 
along a pathway that did not start from the end of the peptide and was not sequential. In 
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fact, there is another interesting similarity between the two pathways — approximately half- 
way through the folding process, the already formed hydrogen bonds in poly-L-leucine were 
intermittently broken to create a turn in the peptide. This disordering was also accompanied 
by an increase in the distance RMSD with respect to the helical structures, and a large decrease 
in A 3 . The turri, however, was only a transient state, and, subsequently, the broken hydrogen 
bonds reformed, permitting further folding. As commented on recently by Dobson et al ., 85 0- 
turns or nascent helices can be viewed as initiation sites that increase the probability to coalesce 
towards motifs involving longer range contacts. By analogy, it can be argued that the /?-turn 
in LQQLLQQLLQL is also an intermediate, and, in a longer simulation, the peptide would return 
to its partially folded state. This would be consistent with the concept of the nascent helix, 
an ensemble of transient, partially ordered secondary structures, which rapidly inter-convert 
by way of unfolded states. 86 The study on poly-L-leucine indicates that this concept applies 
to interfacial folding. 14 The presence of a /?-turn among conformations of the nascent helix is 
not surprising. Several amphipathic peptides were shown to adopt this structure at the water- 
membrane interface. 87-90 Furthermore, in water-soluble proteins, /?-turns exist predominantly 
at protein surfaces, in contact with the solvent. 91 It has also been argued that /?-turns are 
common folding intermediates. 92 It is, thus, likely that their stability at an interface may not 



be very different from the stability of short, partially folded helices. 

Although the folding of the undecamer of poly-L-leucine and LQQLLQQLLQL exhibit some 
similarities, there are also important differences, the chief of which is the time-scales of these 
two processes. Poly-L-leucine fully folded in 36 ns, whereas LQQLLQQLLQL partially folded after 
over 100 ns, only to unfold towards the end of the 161 ns simulation. Although single trajectories 
are insufficient to draw conclusions about time-scales of folding events, the difference in these 
time-scales for the two peptides is sufficiently large to infer that LQQLLQQLLQL folds more 
slowly than poly-L-leucine. To understand why this might be so, even though simulation 
conditions and potential energy functions were the same in both cases, it is necessary to consider 
how various characteristics that differentiate the two peptides may influence folding. 

Before we proceed to the discussion of this point, we first consider the hypothesis that an a- 
helix is the most stable structure for LQQLLQQLLQL. Although there is no experimental data 
for this peptide, results for other amphipathic peptides, presented in the introduction, support 
this assumption. A thorough test in computer simulations, at least to within the accuracy of 
the potential energy functions, would require calculating the free energy of the a-helix relative 
to other conformational states or the peptide. This daunting task has not been carried out 
m this study. Instead, a separate, 11.3 nanosecond-long molecular dynamics trajectory was 
obtained starting with an equilibrated a-helical conformation of the undecapeptide placed at 
the water-hexane interface. For the entire length of the simulation, the peptide remained in an 
amphipathic orientation, such that the leucine and the glutamine side chains were immersed 
in hexane and water, respectively. The distance RMSD between the simulated peptide and 
the ideal helix remained quite small throughout the trajectory, and was equal, on average, 
to 0.53 A. Similarly, A 3 was equal to 0.172, close to the value of the ideal a-helix, with the 
standard deviation of 0.003. This deviation is much smaller than the standard deviation for 
the peptide in the 161.5 ns simulations, equal to 0.045, This, in turn, indicates that the a- 
helix is markedly more rigid than structures in the families (a-f), and exhibits no tendency 
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to melt, which would be manifested by an increased flexibility. Although the length of this 
simulation is not sufficient to draw a firm conclusion that an at — helix is the preferred structure 
at the interface, the remarkable stability of this conformation during the simulation certainly 
supports such an assumption. 

Another view on the stability of the at — helix can be obtained by calculating the free energy 
of unfolding a single residue at both the N— and the C— terminus ends of the undecapeptide 
i.e. the free energy for going from a right-handed at — helical to a /?— strand conformation. 93 
The starting point for this series of calculations was an a-helical conformation at the water- 
hexane interface obtained at the end of the 11.3 ns simulation described above. The free 
energy of unfolding was evaluated <*s a function of the ip dihedral angle, using the “umbrella 
sampling'’ method. 94 In the case of the N-terminus, five overlapping “window's” 95,94 were needed 
to explore the range of values accessible to ip, i.e., -60.0 < ip < 160.0. At the C-terminus, four 
windows were necessary to examine the same range of ip. The lengths of sampling trajectories 
in each window varied between 1.2 and 3.0 ns, yielding a total simulation time of 11.5 and 
7.0 ns to estimate the free energy of unfolding at the N- and the C-terminus, respectively. To 
restrain the values of ip in a given window, a harmonic potential was applied. Furthermore, 
a biasing potential was added to yield a more uniform probability distribution of the dihedral 
angle within each window, thereby improving the accuracy of the calculations. The complete 
free energy profiles at both ends were obtained by matching in a self-consistent manner the 
individual curves in the overlapping regions, using the weighted histogram analysis method 
(WHAM). During this set of simulations, the remainder of the undecapeptide was restrained 
in an a-helical conformation by means of soft harmonic potentials. 

As can be observed from Figure 9, the a-helix is considerably more stable than both con- 
formations unfolded at the ends. The two termini, however, are not equivalent. The free 
energy difference between the a-helical and the /3-strand conformation of the peptide bond 
is markedly smaller at the C-terminus than at the N-terminus, viz. 4.75 and 8.66 kcal/mol, 
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respectively. This suggests that the peptide structure is more flexible at the N-terminus than 
at the C-terminus. This agrees with the Ramachandran maps for residues Leu 1 and Leu 11 , 
shown in Figure 5 - Leu 1 stays in the “/?” quadrant over the whole 161.5 ns of the simula- 
tion, while Leu 11 explores both the “o R ” and the “/?” quadrants. Moreover, the free energy 
barrier for rotation from a R to (3 is approximately 10 kcal/mol. If barriers between other con- 
formational states are similar, it may explain why the undecapeptide evolves very slowly in 
its conformational space at 300 K. The results from the calculations presented here should be 
taken cautiously, however. Simulations of unfolding of poly-L-leucine from the N-terminus in 
water, 93 using the Cornell et al. potential energy functions 75 also yielded a high free energy dif- 
ference between the folded and unfolded states, together with a significant free energy barrier. 
Yet, once the potentials were modified to reproduce relative energies of different conformations 
of short peptides in vacuum, obtained from ab initio quantum mechanical calculations, 97 both 
the free energy differences and the barriers were markedly reduced. 

Returning to the issue of slow conformational evolution, we note that one structural feature 
distinguishing LQQLLQQLLQL from poly-L-leucine is the ability of the glutamine side chains 
to form intramolecular hydrogen bonds with the polar moieties of the backbone. Once such 
bonds are formed, the hydrogen-bonded centers in the backbone are no longer available for the 
through-backbone interactions necessary to stabilize helical conformations. The time-evolution 
of the number of backbone-side chain hydrogen bonds is displayed in Figure 10. As can be 
seen from this figure, such bonds are almost always present and, occasionally, as many as four 
of them exist at the same time. The most common are those formed between the -NH 2 group 
of the side chain and the carbonyl group of the backbone in the adjacent residue. In contrast, 
bonds to the carbonyl group of the same residue are scarce. Hydrogen bonds between the -C=0 
moiety of the glutamine side chain and the amino groups of the backbone are equally marginal. 
Backbone-side chain hydrogen bonds frequently exist at the N-terminus of the peptide, but 
only rarely at the C-terminus. Specifically, 39% of them form between -NH 2 moiety of Gin 1 


and the carbonyl of the -Ac capping group, and an additional 24% is formed between the -NH 2 
group of Gin 3 and the carbonyl of Leu 2 . A similar tendency towards the formation of backbone- 
side chain hydrogen bonds at the capping positions was also noted for water-soluble proteins. 98 
In the context of the present simulation, these hydrogen bonds may explain why the formation 
of a partial helix started near the middle of the peptide rather than at the C-terminus. 

A similar analysis carried out on the 11.3 ns molecular dynamics trajectory for the a-helical 
undecapeptide reveals that the backbone-side chain hydrogen bond population is marginal and 
involves only Gin 2 and Gin 3 . In contrast with the results obtained from the 161.5 ns trajectory, 
only hydrogen bonds between the -NH 2 moiety in the side chain and the carbonyl group of the 
same residue are observed. This, again, supports the assumption than an amphipathic a-helix 
of LQQLLQQLLQL is a stable structure, at least over short time-scales. 

Another factor that may influence the kinetics of a-helix formation is connected with the na- 
ture of folding intermediates formed by LQQLLQQLLQL. The simulation of poly-L -leucine at 
the water-hexane interface revealed that its folding proceeded through a short-lived intermedi- 
ate, the 3io-helix, directly preceding the formation of the a-helix. 14 This is in agreement with 
experimental studies on short poly-L-alanine peptides in aqueous solution carried out by Mill- 
hauser et a/., 99,100 which led to the conclusion 101 that a 3io~helix is an important intermediate 
on the pathway connecting a random coil to an a-helix. A similar conclusion emerged from 
other experimental 92 and theoretical studies. 102-104 The exact role of 3xo~helices in facilitating 
folding is not clear, but it may be important that the formation of a through-backbone hydro- 
gen bond in this structure requires that two consecutive peptide bonds adopt appropriate (4>, 
VO angles, while three peptide bonds must have the correct conformation to form a hydrogen 
bond in an a-helix. Once a 3io~helix is formed, conversion to an a-helix should be highly 
facilitated, both kinetically and thermodynamically. In fact, several peptides have been shown 
to exist in an equilibrium between the two types of helices both in water 105 and in non-polar 
media. 106-110 For LQQLLQQLLQL and other peptides capable of forming amphipathic a-helices, 
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3,0-helices are, however, not amphipathic, and, therefore, may not be stable intermediates at 

interfaces. This inability to reach the 3 10 -helix on the folding pathway may, in turn, impede 
folding. 

The suggestion that a non-amphipathic conformation of the 3 10 -helix may not be a folding 
intermediate raises the question whether the structures encountered during the simulation are 
amphipathic. The density profiles in Figure 1 indicate that this is, indeed, the case. To explore 
this point further, the degree of amphipathy of LQQLLQQLLQL was estimated from its mean 
structural hydrophobic moment, 111 (hh), computed over the complete trajectory: 

1 N 

(Mh) = TjF 22 h i S « (4) 

1 = 1 

where A denotes the number of amino acid residues in the peptide, A,- is the hydrophobicity 
of residue i, and s, is the unit vector pointing from the a-carbon atom of residue i to the 
center of mass of its side chain. Hydrophubicities, A,-, were taken from the set published by 
Wolfenden, et al., 112 but, as observed by Eisenberg et ai, (/* H ) does not depend significantly 
on the choice of the hydrophobicity scale. In Figure 11, the time-evolution of (^ H ) is shown 
and compared to the values characteristic of the ideal /?-strand, 3 10 - and a-helices. 82 The most 
striking feature of this graph is that the average of </* H > over the full MD trajectory is equal to 
0.32, which is very close to the value of 0.34, expected for an a-helix, but quite different from 
0.20 and 0.16, typical to a /?— strand or a 3io~ helix, respectively. It, thus, appears that, although 
the undecapeptide is not in an a-helical conformation, its amphipathy at the water-hexane 
interface is nearly optimal. 

The results, discussed above, lead to a general hypothesis about interfacial folding of LQQL- 
LQQLLQL and other peptides with similar periodicity of hydrophobic and hydrophilic amino 
acids. All of them tend to adopt amphipathic structures. The o-helix is probably the most 
stable of them, but there are many others. Thus, driven by the tendency to maintain orienta- 
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tions in which hydrophobic and hydrophilic residues are exposed, respectively, to the aqueous 
and the non-polar phases, peptides evolve in their conformational space along amphipathic 
pathways. Transitions between consecutive amphipathic structures may, however, require sur- 
mounting substantial free energy barriers associated to the dehydration of polar moieties. In 
addition, folding may be further frustrated by bottlenecks created by the presence of non— 
amphipathic conformations on folding pathways. This would yield a frustrated free energy sur- 
face with multiple, deep, local minima that would impede folding by acting as <4 kinetic traps.” 
A qualitatively similar mechanism has been proposed to explain folding of large, water-soluble 
proteins. 113 

Summary and Conclusions 


The 161.5 ns molecular dynamics simulation of LQQLLQQLLQL in the water— hexane system 
reveal" the importance of amphipathic structures in defining conformational transitions of ttiis 
peptide at interfaces between water and non-polar media. Many such structures are available 
in conformational space and some of them were explored during the simulation. 

The almost complete absence of non-amphipathic conformations suggests that they are strongly 
disfavored at the interface. On this basis, it is hypothesized that the undecapeptide and other, 
similar, short peptides fold in the interfacial environment through a series of amphipathic 
intermediates. In the present simulation, however, the a-helix, which presumably is the lowest 
free energy structure, was never reached. It is further hypothesized that energetic and entropic 
penalties associated with folding along amphipathic pathways markedly impede evolution of 
the peptide in its conformational space. 

Several other factors may be also responsible for slow folding of LQQLLQQLLQL, compared to 
poly-L-leucine, which adopted an a-helical conformation in less than 40 ns. 14 One of them is 
the inability of LQQLLQQLLQL to pass through a 3i 0 -helical folding intermediate, which is not 
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amphipathic for this peptide. Another is the formation of hydrogen bonds between the side 
chains of the L-glutamine residues and the carbonyl groups in the backbone. The significance 
of these bonds might be established more clearly by comparing, for example, folding pathways 
for LQQLLQQLLQL and a similar undecapeptide with L-serine substituted for L-glutamine. 

The peptide initially remained in conformations resembling the starting configuration — i.e. 
a /?- strand. Subsequently, it partially folded, forming three consecutive backbone hydrogen 
bonds characteristic of an a-helix, only to unfold into a type I /?-turn towards the end of 
the simulations. By analogy to the simulated folding of poly-L-leucine at the water-hexane 
interface , 14 it may be proposed that these conformations are not just typical structures in the 
ensemble of unfolded states, but represent the nascent helix, which will eventually evolve to 
the fully folded conformation. 


Interfacial folding of amphipathic peptides is of interest not only in the context of water- 
membrane systems, but may also be relevant to iclding of water-soluble proteins. In those 
instances, amphipathic helices do not form fully exposed to water, but rather at the “interface” 
between the solvent and the rest of the protein, accompanied by its hydrophobic collapse. This 
point was illuminated in the recent molecular dynamics simulation of protein folding in an aque- 
ous solution 52 and is further underscored by findings that some peptides which are disordered 
in water in their monomeric form adopt a-helical conformations upon aggregation . 114,115 

To evaluate results of this study one should be cognizant of its limitations. One of them is 
associated with the timescale of the simulation. Even though the molecular dynamics trajectory 
discussed in this study is among the longest obtained for biologically relevant molecules in 
condensed phases, it is clearly not sufficiently long. As has been shown in the example of 
a small protein in aqueous solution , 52 extending the trajectory by an order of magnitude is 
feasible. Yet, even if the peptide folded completely, as was the case for poly-L-leucine at 
the water hexane interface, it still would not be fully satisfactory, since drawing far-reaching 


conclusions from a single trajectory is not justified. Instead, a statistical sample of folding 
pathways should be obtained, but this is beyond reach of current computer capabilities. Along 
these lines, the present simulation should not be treated as an isolated study, but rather as 

a complement to a growing body of experimental and theoretical knowledge about interfacial 
folding. 


Another limitation of the present study is related to the quality of potential energy functions. 
These functions have been calibrated to reproduce either free energy surfaces of very small 
protein fragments or equilibrium structures of water-soluble proteins. Relatively little effort, 
however, has been devoted to parametrizing potential energy functions such that they would 
correctly predict free energy barriers during conformational transitions in peptides and proteins. 

Finally, the water— hexane interface is only a very simplified model of a water— membrane system. 
Several effects that may influence folding, such as specific and non-specific interactions between 
the peptide a„.d the head groups, reorganization of the bilayer and matching between the width 
of the peptide and the head group region, cannot be captured using this model. 

Despite all these difficulties, atomic— level computer simulations remain a promising approach 
to protein folding in solution. Considering the great technical difficulties in experimental in- 
vestigations of early folding events at water-membrane interfaces, such as the formation of 
secondary structures and insertion into the bilayer, these computational approaches become 
especially valuable. In the near future, they are anticipated to provide many valuable insights 
into the mechanism of interfacial folding. 
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Figure captions 


Figure 1. Density profiles of water (solid line), hexane (long-dashed line), the center of mass of 
the undecapeptide (short-dashed line), leucine side chains (dotted-dashed line) and glutamine 

side chains (dotted line) at the water-hexane interface averaged over the complete molecular 
dynamics trajectory. 

Figure 2. Free energy profile for finding the center of mass of LQQLLQQLLQL along the z- 
direction perpendicular to the interface. This curve was constructed using the last 71.5 ns of 
the molecular dynamics trajectory. 

Figure 3. Two-dimensional distance root mean square deviation (RMSD) map for all pairs 
of structures of the LQQLLQQLLQL undecapeptide encountered during the 161.5 ns trajectory. 
Only those pairs for which the distance RMSD was less than 2.0 A are plotted as points on the 
map. Distance RMSDs were computed for backbone atoms only. 

Figure 4. Snapshots of typical conformations of the undecapeptide in each of the families (a)- 

(f) identified on the basis of the clustering analysis in Figure 3. Intramolecular hydrogen bonds 
are represented by dotted lines. 

Figure 5. (<p, xp) Ramachandran maps for the eleven residues forming the terminally blocked 
undecapeptide. Conformations within the same structural family are represented by dots of the 
same size and hue. The size of dots increases along the trajectory, i.e., the smallest, darkest 
dots represent family (a), whereas the largest, lightest dots represent family (f). 

Figure 6. Time-evolution of the asymmetry, A 3 , of the undecapeptide (solid line). Character- 
istic values of A 3 for the a-helix (short-dashed line), the /?-strand (long-dashed line) and the 
3io~helix (dotted line) are also shown. 

Figure 7. Time-evolution of the nine C=0,_ 4 - • H-N, intramolecular backbone hydrogen bonds 
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of the undecapeptide. 

Figure 8. Distance root mean square deviation (RMSD) of the LQQLLQQLLQL undecapeptide 
simulated over 161.5 ns at the water-hexane interface, with respect to an ideal a-helix, i.e. 
<}> = —57°, xf) = -47° (solid line) and 3 10 -helix, i.e. 4> = -49°, = -26° (dashed line). 82 

Figure 9. Free energy profile for unfolding the first, N-terminal (solid line) and the last, 
C-terminal (dashed line) residues in the a-helical structure of the terminally blocked LQQL- 
LQQLLQL undecapeptide at the water-hexane interface. 

Figure 10. Time-evolution of the number of intramolecular backbone-side chain hydrogen 
bonds in the undecapeptide at the water-hexane interface. 

Figure 11. Time-evolution of the mean structural hydrophobic moment (/x H ) (solid line) and 

the average of (/i H ) (long-dashed line) of the undecapeptide. The values characteristic of the 

a-helix (short-dashed line), the /?-strand (dotted line) and the 3 10 -helix (dotted-dashed line) 
are also shown. 
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